
' . ^K^TO' REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE'CflOECKER 

'.■/'■'•■: : ^v:- -VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 

TRADEMARK OFFICE WEBSITE. SEE BELOW: ClA ^ - v. . v 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 
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1646 



RAW SEQUENCE LISTING DATE: 06/2 3/200 0 

PATENT APPLICATION: US/09/005 f 3 18B TIME: 06:20:30 

Input Set : A:\401cl.app 

Output Set: N:\CRF3\06222000\I005318B.raw 



4 (1) GENERAL INFORMATION: 



SEQUENCE LISTING Does Not Comnh, 

" Corrected Diskette Needed 



c--> 


6 


<i> 


APPLICANT: Hein, Mich B. 




7 




Hiatt, Andrew C. 




8 




Fitchen, John H. 




10 


(ii) 


TITLE OF INVENTION: NOVEL EPITHELIAL TISSUE TARGETING 




12 


(iii) 


NUMBER OF SEQUENCES: 140 




14 


(iv) 


CORRESPONDENCE ADDRESS: 




15 




(A) ADDRESSEE: SEED IP LAW GROUP PLLC 




16 




(B) STREET: 6300 Columbia Center, 701 Fifth Avenue 




17 




(C) CITV: Seattle 




18 




(D) STATE: Washington 




19 




(E) COUNTRY: USA 




20 




( F ) ZIP: 98104 




22 


(v) 


COMPUTER READABLE FORM: 




23 




(A) MEDIUM TYPE: Floppy disk 




24 




(B) COMPUTER: IBM PC compatible 




25 




(C) OPERATING SYSTEM: PC-DOS/MS-DOS 




26 




(D) SOFTWARE: Patent In Release #1.0, Version #1.30 


c--> 


28 


(vi) 


CURRENT APPLICATION DATA: 


c--> 


29 




(A) APPLICATION NUMBER: US/09/005 , 318B 


c-~> 


30 




(B) FILING DATE: 09-Jan-1998 




31 




(C) CLASSIFICATION: 




33 


(viii) 


ATTORNEY/AGENT INFORMATION: 




34 




(A) NAME: Sharkey, Richard G. 




35 




(B) REGISTRATION NUMBER: 3 2,62 9 




36 




(C) REFERENCE/DOCKET NUMBER: 310 098.401C1 




38 


(ix) 


TELECOMMUNICATION INFORMATION: 




39 




(A) TELEPHONE: (206) 622-4900 




40 




(B) TELEFAX: (206) 682-6031 



AGENT 



ERRORED SEQUENCES 

E--> 4 72 (2) INFORMATION FOR SEQ ID NO: 

4 59 (i) SEQUENCE CHARACTERISTICS: 

475 (A) LENGTH: 286 base pairs 

476 (B) TYPE: nucleic acid 

477 (C) STRANDEDNESS : single 

478 (D) TOPOLOGY: linear 
4 67 (ix) FEATURE: 
4 68 (A) NAME/KEY: CDS 
469 (B) LOCATION: 1. . 282 
4 80 (ix) FEATURE: 

4 81 (A) NAME/KEY: CDS 

482 (B) LOCATION: 1..282 

E--> 4 88 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/005 , 3 18B 



DATE: 06/23/2000 
TIME: 06:20:30 



Input Set : A:\401cl.app 



W--> 



w-- > 
w-- > 

E--> 




W--> 



w--> 











Output Set: 


N:\CRF3\06222000\I005318B.raw 






490 


GAT 


CAG 


AAG 


TGC 


AAG 


TGT 


GCT 


CGT 


ATT 


ACT 


TCT 


AGA 


ATC 


ATC 


CGT 


AGC 


4 91 


Asp 


Gin 


Lys 


Cys 


Lys 


Cys 


Ala 


Arg 


He 


Thr 


Ser 


Arg 


He 


He 


Arg 


Ser 


492 


15 










20 










25 










3 0 


494 


TCA 


GAG 


GAC 


CCA 


AAT 


GAA 


GAT 


ATA 


GTC 


GAA 


CGT 


AAC 


ATC 


CGT 


ATC 


ATC 


4 95 


Ser 


Glu 


Asp 


Pro 


Asn 


Glu 


Asp 


He 


Val 


Glu 


Arg 


Asn 


He 


Arg 


He 


He 


4 96 










35 










40 










4 5 




498 


GTC 


CCA 


CTG 


AAT 


AAC 


CGG 


GAG 


AAT 


ATC 


TCA 


GAT 


CCT 


ACA 


AGT 


CCG 


TTG 


4 99 


Val 


Pro 


Leu 


Asn 


Asn 


Arg 


Glu 


Asn 


He 


Ser 


Asp 


Pro 


Thr 


Ser 


Pro 


Leu 


500 








50 










55 










60 






502 


CGC 


ACA 


CGC 


TTC 


GTA 


TAC 


CAC 


CTG 


TCA 


GAT 


CTG 


TGT 


AAG 


AAG 


TGT 


GAT 


50 3 


Arg 


Thr 


Arg 


Phe 


Val 


Tyr 


His 


Leu 


Ser 


Asp 


Leu 


Cys 


Lys 


Lys 


Cys 


Asp 


504 






65 










70 










75 








506 


CCA 


ACA 


GAG 


GTA 


GAG 


CTG 


GAC 


AAT 


CAG 


ATA 


GTC 


ACT 


GCG 


ACT 


CAA 


AGC 


507 


Pro 


Thr 


Glu 


Val 


Glu 


Leu 


Asp 


Asn 


Gin 


He 


Val 


Thr 


Ala 


Thr 


Gin 


Ser 


508 




80 










85 










90 










510 


AAC 


ATT 


TGC 


GAT 


GAG 


GAC 


AGC 


GCT 


ACA 


GAA 


ACC 


TGC 


TAC 


TGA 






511 


Asn 


He 


Cys 


Asp 


Glu 


Asp 


Ser 


Ala 


Thr 


Glu 


Thr 


Cys 


Tyr 








512 


95 










100 










105 












514 


ATTC 






























517 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 


L4: 
















519 




(i) 


SEQUENCE CHARACTERISTICS: 
















520 






(A) LENGTH: 105 base pairs 














521 






(B) TYPE: 


nucleic 


acid 
















522 






(C) STRANDEDNESS : 


single 
















523 






(D) TOPOLOGY: 


linear 


















525 




(ix) 


FEATURE: 
























526 






(A) NAME/KEY: 


CDS 




















527 






(B) LOCATION: 


1, .105 


















533 




<xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 14: 










535 


GAT 


CTG 


TGT 


AAG 


AAG 


TGT 


GAT 


CCA 


ACA 


GAG 


GTA 


GAG 


CTG 


GAC 


AAT 


CAG 


536 


Asp 


Leu 


Cys 


Lys 


Lys 


Cys 


Asp 


Pro 


Thr 


Glu 


Val 


Glu 


Leu 


Asp 


Asn 


Gin 


537 


95 










100 










105 










110 


539 


ATA 


GTC 


ACT 


GCG 


ACT 


CAA 


AGC 


AAC 


ATT 


TGC 


GAT 


GAG 


GAC 


AGC 


GCT 


ACA 


540 


He 


Val 


Thr 


Ala 


Thr 


Gin 


Ser 


Asn 


He 


Cys 


Asp 


Glu 


Asp 


Ser 


Ala 


Thr 


541 










115 










120 










125 




543 


GAA 


ACC 


TGC 




























544 


Glu 


Thr 


Cys 




























1191 


(2) 


INFORMATION FOR SEQ ID 


NO: 


45: 
















1193 




(i) SEQUENCE CHARACTERISTICS: 
















1194 






(A) LENGTH: 16 amino 


acids 














1195 






(B) TYPE 


amino acid 


















1196 






(C) STRANDEDNESS 




















1197 






(D) TOPOLOGY 


linear 


















1203 




(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 45: 










1205 



































48 



96 



144 



192 



240 



282 



286 



RECEIVED 

jul mm 

TECH CENTER 1660/2908 



48 



96 



105 



Ala He Gin Asp Pro Arg Leu Phe Ala Glu Glu Lys Ala Val Ala Asp 

2303 (2) INFORMATION FOR SEQ ID NO: 111: 
23 05 (i) SEQUENCE CHARACTERISTICS : 

2306 (A) LENGTH: 30 amino acids 

2307 (B) TYPE: amino acid 



c/ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/00 5 , 318B 



DATE : 06/23/2000 
TIME : 06:20:31 



E--> 



E--> 



2308 
2309 
2314 
2316 
2317 
2319 
2320 
2322 
2324 
2325 
2326 
2327 
2328 
2334 
2336 
2337 
2339 
2340 



Input Set : A:\401cl.app 

Output Set: N:\CRF3\06222000\l005318B.raw 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
Val Arg Asp Gin Ala Gin Glu Asn Arg Ala Ser Gly Asp Ala Gly 
1 5 10 15 

Ser Ala Asp Gly Gin Ser Arg Ser Ser Ser Ser Lys Val Leu Phe 
16 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
SEQUENCE DESCRIPTION: 



c- 



(xi) 



SEQ ID NO: 112: 

Val Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro Thr 
1 5 10 15 

Pro Ser Pro Ser Cys Cys His Pro Arg Leu 
16 20 25 
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VERIFICATION SUMMARY 

PATENT APPLICATION : US/09/005 , 3 18B 



DATE: 06/23/2000 
TIME: 06:20:32 



Input Set : A:\401cl.app 

Output Set: N:\CRF3\06222000\l0053L8B.raw 



L:6 M:220 C: Keyword misspelled or invalid format, [(i) APPLICANT : ] 

L:28 M:220 C: Keyword misspelled or invalid format, [(vi) CURRENT APPLICATION DATA:] 

L:29 M:220 C: Keyword misspelled or invalid format, [ ( A) APPLICATION NUMBER : ] 

L:30 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:228 M:341 W: (46) "n" or "Xaa" used, for SEQ IDtt : 5 

L:237 M:341 W: (46) "n" or "Xaa" used, for SEQ ID* : 5 

L:433 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 11 

L:480 M:201 W: Mandatory field data missing, SeqNo=13, [INFORMATION FOR SEQ ID NO:] 

L:480 M:202 E: (16) Value must be an Integer, Data=[] 

L:488 M:212 E: (34) Invalid or duplicate Sequence ID Number, Data=[13:] 
L:492 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:496 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:500 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:504 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:508 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:512 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:-1 
L:517 M:216 E : (34) Seq.#s missing, 15 thru 13 

L:537 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 14 
L:541 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 14 
L:1148 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 42 
L:1152 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 42 
L:1205 M:203 E: No. of Seq . differs, LENGTH : Input : 16 Found: 0 SEQ: 45 
L:2317 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:111 
M:332 Repeated in SeqNo=lll 

L:2337 M:332 E : (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 112 
M:332 Repeated in SeqNo=112 

L:2486 M:341 W: (46) n n" or "Xaa" used, for SEQ ID#:118 
L:2489 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:118 
L:2640 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:123 
L:2646 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:123 
L:2649 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:123 
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STATISTICS SUMMARY 

PATENT APPLICATION: US/09/00 5 r 318B 



DATE: 06/23/2000 
TIME: 06:20:32 



Input Set : A:\401cl.app 

Output Set: N:\CRF3\06222000\I005318B.raw 



Application Serial Number: 03/09/005,3186 
Alpha or Numeric : Alpha 
Application Class: 514 
Application File Date: 01-09-1998 
Art Unit: 1646 

Software Application: Patentln 
Total Number of Sequences: 140 
Number of Errors : 8 
Number of Warnings: 19 
Number of Corrections : 4 

MESSAGE SUMMARY 

201 W: 1 (Mandatory field data missing) 

202 E: 1 ((16) Value must be an Integer) 

203 E: 1 (No. of Seq . differs) 

212 E: 1 ((34) Invalid or duplicate Sequence ID Number) 
216 E: 1 ((34) Seq.#s missing) 

220 C: 4 (Keyword misspelled or invalid format) 

332 E: 4 ((32) Invalid/Missing Amino Acid Numbering) 

341 W: 7 ((46) "n" or "Xaa" used) 

336 W: 11 (Invalid Amino Acid Number in Coding Region) 
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